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NUCLEIC ACID SUPPORTED PROTEIN COMPLEMENTATION 

FIELD OF THE INVENTION 

[001] This application claims benefit under 35 U.S.C. §1 19(e) of USSN 
60/417,323 filed October 9, 2002 and USSN 60/478,803 filed June 16, 2003. 

[002] The present invention provides a method for in vivo and in vitro detection of 
nucleic acid targets, including DNA and RNA targets. 

BACKGROUND OF THE INVENTION 

[003] There is a great demand for methods to detect and quantify the presence of a 
target nucleic acid sequence of interest. In particular, there is a great demand for 
methods that can be used to detect and quantitate the presence of a target nucleic acid 
in vivo and in vitro. 

[004] For example, detection of RNA in vivo is of particular interest. During the 
last several years, knowledge on multiple functional roles of RNAs in gene expression 
has substantially increased, as has the number of newly isolated RNA molecules. 
Additionally, it has become clear that mRNA localization plays an important role in 
directing specific proteins to their correct position within the cell, thus regulating gene 
expression and cell development (Johnston, 1995; Oleynikov&Singer, 2003). To 
fully understand functions of various RNAs, methods of studying their dynamic 
behavior within living cells are required. 
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[005] Different strategies for labeling RNA molecules for in vivo detection have 
been used (for a review, see Pederson, 2001). Most methods rely on different RNA- 
specific hybridization probes and microinjection or lipofection for probe delivery to 
the nucleus or cytoplasm. For example self-ligating quenching probes were 
successfully used by Kool and co-workers for RNA hybridization within bacterial 
cells (Sando & Kool, 2002). Fluorescent 2-O-methyl-RNA probes have been shown 
to be more stable than unmethylated oligonucleotides and have been used by several 
groups (Carma-Fonseca et al 9 1999; Molenaar et al. y 2001). Molecular beacons, 
(Sokol et aL, 1998; Perlette &Tan, 2001), RNA hybridizing oligonucleotide probes 
, capable of fluorescence resonance energy transfer (FRET) (Matsuo, 1998; Tsui et aL, 
2000; Tsuji et al, 2001; Sei-Iida et aL, 2000) and caged-fluorescein labeled antisense 
oligonucleotides present different variants of the same approach (for a review see 
Politz, 1999, Pederson, 2001). 

[006] However, all of these approaches are probe-based, and thus are limited by 
the low sensitivity of hybridization due to the low concentration of RNA within the 
cell. In most cases, only highly abundant RNA species can be detected. Thus, 
because of their abundance, it has been possible to detect P-actin mRNA, c-fos 
mRNA, basic fibroblast growth factor RNA, and total poly (A)-RNA). 

[007] Another obstacle of using oligonucleotide probes for RNA detection in vivo 
is their fast accumulation in the nucleus (Tsuji et aL, 2000; Molenaar et aL 9 2001). 
One approach to overcome this difficulty has been the use short oligonucleotide 
hybridization probes bound to streptavidin via biotin-straptavidin interactions, to 
lower their passage through nuclear pores (Tsuji et aL, 2000). 
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[008] Yet another strategy to study RNA in vivo has been to generate fusions 
between functionally important RNA-binding proteins and fluorescent proteins, which 
allows RNA kinetic studies. For example, real time RNA movement in yeast cells 
was monitored by co-expressing a GFP-MS2 fusion protein and an ASH1 RNA 
reporter fused to MS2 binding sequences (Bertrand et al> 1998; Beach et al., 1999; 
01eynikov& Singer, 2003). In these studies, GFP or its spectral variants were used as 
fluorescent tags. Thus, no signal amplification was generated in this approach, which 
again restricts the method to detection of only highly abundant RNAs. Another 
drawback of this approach is the background signal of the expressed fluorophore- 
protein chimera, which substantially limits the sensitivity of the method. 

[009] The majority of detection methods have relied on the use of nucleic acid 
probes to detect a nucleic acid of interest. Probe-based assays are useful in the 
detection, quantitation and analysis of nucleic acids. Nucleic acid probes have long 
been used to analyze samples for the presence of nucleic acid from bacteria, fungi, 
virus or other organisms and are also useful in examining genetically-based disease 
states or clinical conditions of interest. Nonetheless, probe-based assays have been 
hampered in part by difficulties associated with specificity, sensitivity and reliability, 
and the challenges presented by detection of nucleic acids in vivo. 

[0010] Nucleic acid hybridization is a fundamental process in molecular biology. 
Sequence differences as subtle as a single base (point mutation) in very short 
oligomers (<10 base pairs "bp") can be sufficient to enable the discrimination of the 
hybridization to complementary nucleic acid target sequences as compared with non- 
target sequences. However, nucleic acid probes of greater than 10 bp in length are 
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generally required to obtain the sequence diversity necessary to correctly identify a 
unique organism, disease state or clinical condition of interest. 
[0011] Several nucleic acid probe-based methods have been developed for the 
detection of nucleic acids. Most are designed around the amplification of selected 
targets and/or probes composed of DNA, including the polymerase chain reaction 
(PCR), ligase chain reaction (LCR), self-sustained sequence replication (3SR), nucleic 
acid sequence based amplification (NASBA), strand displacement amplification 
(SDA), amplification with Q.beta. replicase (Birkenmeyer and Mushahwar, J. 
Virological Methods, 35:117-126 (1991); Landegren, Trends Genetics, 9:199-202 
(1993)), rolling circle amplification (RCA), and linear rolling circle amplification 
(LRCA), which uses a primer annealed to the circular target DNA molecule before 
adding DNA polymerase. 

[0012] However, these methods are associated with a number of difficulties, 
including relatively low precision in quantitative measurements, being laborious, 
expensive, time-consuming, inefficient, and lacking in sensitivity. In addition, there 
are sources of error in such methods, such as where structural differences lead to 
different efficiencies. 

[00 1 3] Any hybridization of a nucleic acid probe to a closely related non-target 
sequence will result in the generation of undesired background signal. Because the 
sequences are so closely related, point mutations are some of the most difficult of all 
nucleic acid modifications to detect using a probe-based assay. Numerous diseases, 
such as sickle cell anemia and cystic fibrosis, are caused by a single point mutation of 
genomic nucleic acid. 
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[0014] Furthermore, all of these methods suffer from a lack of sensitivity, especially 
to rare genetic events, such as infrequent mutations. 

[0015] In addition, the detection of nucleic acids in vivo, including in real time, has 
presented a significant challenge. For example, demonstrating hybridization between 
an antisense oligonucleotide and its mRNA target has proven extremely difficult in 
living cells. The development of molecular beacon technology has provided one new 
approach to nucleic acid detection in vivo. See Sokol et al., Proc. Natl. Acad. Sci. 
USA 95:11538-43 (1998); Perlette et al., Anal. Chem. 73:657 A (2001). 

[0016] Accordingly ^ it would be highly desirable to have improved methods for the 
detection of target nucleic acids, including methods for in vivo detection. Such 
improved detection of DNA and RNA targets would be useful in the detection, 
analysis and quantitation of nucleic acid containing samples. 

SUMMARY OF THE INVENTION 

[0017] The present invention is directed to methods for in vitro and in vivo detection 
of target nucleic acid molecules, including DNA and RNA targets, as well as nucleic 
acid analogues. In the method of the present invention, a target nucleic acid is 
detected by its binding to a complementation complex, wherein the complementation 
complex comprises a first complementation molecule and a second complementation 
molecule. The first complementation molecule comprises a first polypeptide portion 
coupled to a first probe portion, which specifically interacts with the target nucleic 
acid; the second complementation molecule comprises a second polypeptide portion 
coupled to a second probe portion, which specifically interacts with the target nucleic 
acid in close proximity to the first probe. Binding of both probe portions to the target 
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nucleic acid brings the first complementation molecule and the second 
complementation molecule into close proximity, such that they are able to interact and 
form an assembled complementation complex, which can be detected by any means 
which allows recognition of the assembled complex but not the individual 
polypeptides. 

[0018] The target nucleic acid can be detected in vivo or in vitro. The target nucleic 
acid can be single-stranded or double-stranded. 

[0019] In one preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled, active 
enzyme, which can be detected using an enzyme activity assay. Preferably, the . 
enzyme activity is detected by a chromogenic or fluorogenic reaction. In one 
preferred embodiment, the enzyme is dihydrofolate reductase or p-lactamase. 
[0020] In another preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled protein 
with detectable fluorogenic activity. In one preferred embodiment, the assembled 
protein is green fluorescent protein (GFP) or a modified GFP such as EGFP, or any 
other natural or genetically engineered fluorescent protein such as CFG, YFP, or RFP. 

[0021] In another preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled protein 
which contains a discontinuous epitope, which may be detected by use of an antibody 
which specifically recognizes the discontinuous epitope on the assembled protein but 
not the partial epitope present on either individual polypeptide. 

[0022] The first probe portion and the second probe portion may be any molecules 
which interact specifically with a target nucleic acid in close proximity. In one 
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preferred embodiment, the probes are nucleic acids or nucleic acid analogues, such as 
oligonucleotides. In a further embodiment of the present invention, one or both of the 
probe portions is a nucleic-acid binding polypeptide, which interacts with the target 
nucleic acid with high affinity. 

[0023] In another preferred embodiment, the polypeptide portion of the molecule is 
coupled to the probe portion of the complementation molecule such that when the 
probe portions of the first molecule and the second complementation molecules are 
bound to the target nucleic acid, the two polypeptides of the first and second 
. complementation molecules can interact. For example, a flexible linker can be used 
to couple the probe portion and the polypeptide portion of the complementation 
molecule. In one preferred embodiment, the N-terminal amino acid and/or the C- 
terminal amino acid of each polypeptide is cysteine, and the probe has a thiol group at 
the 3' or 5* end, to allow coupling to the N-terminal and/or C -terminal cysteine. 
[0024] In one preferred embodiment, the target nucleic acid can be amplified prior 
to exposure to the components of the complementation complex. For example, rolling 
circle amplification (RCA) can be used to generate a single-stranded DNA target with 
a multiplicity of the same hybridization sites, which bind to the probes of the 
complementation complex. 

[0025] In one particularly preferred embodiment, the first and the second probes 
bind to two adjacent sequences on the target nucleic acid, such that the first probe 
binds to a first target sequence and the second probe binds to a second target 
sequence. In this embodiment, the adjacent sequences are close enough to each other 
to allow the first and second polypeptides to interact when the first and second 
complementation molecules are bound to the target, allowing assembly of the 
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complementation complex. This embodiment provides for detection of single- 
stranded and double-stranded target nucleic acids. For detection of double stranded 
targets, the single-stranded probes interact with the double-stranded target to form a 
triplex. 

[0026] In another particularly preferred embodiment, the first and the second probes 
are oligonucleotides and bind to the same sequence on a single-stranded target nucleic 
acid, forming a triplex. In this embodiment, the first and second polypeptides interact 
when the first and second complementation molecules are bound to the target, 
allowing assembly of the complementation complex. 

[0027] In embodiments providing for formation of a triplex, the probe can be an 
oligonucleotide or a polypeptide. Preferred triplex-forming oligonucleotides are GC- 
rich. A preferred triplex is a purine triplex, consisting of pyrimidine-purine-purine. 

[0028] One embodiment of the present invention provides in vivo detection of target 
nucleic acids. In this embodiment, interaction of the first and second polypeptides 
can be detected in vivo. For example, using a fluorogenic protein such as GFP or 
EGFP, which can be detected in living cells, or using a chomogenic substrate via an 
enzyme such as beta-galactosidase. 

[0029] In one embodiment, the present invention provides a method of detecting the 
presence and/or quantity of target nucleic acid present in a sample. A sample 
containing a target nucleic acid is contacted under hybridization conditions with the 
components of the complementation complex, namely the first molecule and the . 
second molecule. The presence and/or quantity of the assembled protein is indicative 
of the presence and/or quantity of the target nucleic acid. 
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[0030] The present invention also provides methods for isolating a target nucleic 
acid in a sample, even in the presence of non-target sequences. 

[003 1] In another preferred embodiment, the present invention provides kits suitable 
for detecting the presence and/or amount of a target nucleic acid in a sample. Said 
kits comprise at least the components of the complementation complex, namely a first 
probe coupled to a first molecule and a second probe coupled to a second molecule, 
wherein the probes can bind to a target nucleic acid in close proximity. Said kits aiso 
comprise reagents suitable for capturing and/or detecting the present or amount of 
target nucleic acid in a sample. The reagents for detecting the present and/or amount 
of target nucleic acid can include enzymatic activity reagents or an antibody specific 
for the assembled protein. The antibody can be labeled. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0032] Figure 1 shows the chromatographic profile of a conjugation mixture (Alpha- 
cys plus oligonucleotide Bl) on a Sephadex G-25 column. Alpha-cys fragment (1.6 
uM) was mixed with oligonucleotide Bl (50 uM) in 500 ul of buffer M (50 mM 
HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA, 10 mM DTT, 0.5 mM PMSF, 10 mM 
Tris-Cl) and left on ice for 3-4 hrs. Afterwards, the sample was applied by gravity 
flow to a G-25 Sephadex column (NAP-5, Pharmacia) equilibrated with buffer C (50 
mM HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA, 0.5 mM PMSF). Fractions (~ 0.17 
ml)) were collected at room temperature. 

[0033] Figure 2 shows the results of DNA assisted protein complementation. 

Mixing two fragments of EGFP subjected to conjugation with complementary DNA 

oligonucleotides restores fluorescent activity of the protein. Figure 2A: Buffer F (20 
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mM Tris-Cl ph 8.0, 50 mM NaCl, 0.1% Triton X-100). Figure 2B: Purified EGFP 
(1 1 .4 nM) Figure 2C: Alpha and Beta fragments from a mock conjugation reaction 
without DNA Figure 2D: Alpha hybrid containing oligonucleotide Bl mixed in equal 
volume (7.5 pi) with Beta hybrid containing a parallel complementary oligonucleotide 
B2. Figure 2E: Alpha hybrid containing oligonucleotide Bl mixed in equal volume 
(7.5 jil) with Beta hybrid containing an antiparallel complementary oligonucleotide 
(APB2). Figure 2F: Mixture of complementary parallel oligonucleotide (Bl-B2)(20 
|jM) without proteins. Figure 2G: Mixture of complementary antiparallel 
oligonucleotide (B1-APB2) (20 |iM) without proteins. 

[0034] Figure 3 shows the results of parallel versus antiparallel orientation and 
stability of a DNA assisted protein complementation assay. 24 hours after the initial 
measurements described in Figure 2, the mixtures that produced fluorescence (Alpha- 
DNA plus Beta-DNA hybrids) were serially diluted in buffer F (20 mM Tris-Cl, pH 
8.0 9 50 mM NaCl, 0.1% Triton X-100) and assayed for fluorescence. Figure 3 A: 
Buffer. Figure 3B: Alpha hybrid containing oligonucleotide Bl mixed in equal 
volume (7.5 |il) with Beta hybrid containing a parallel complementary oligonucleotide 
(B2) diluted 1/20. Figure 3C: Alpha hybrid containing oligonucleotide Bl mixed in 
equal volume (7.5 \x\) with Beta hybrid containing a parallel complementary 
oligonucleotide (B2) diluted 1/10. Figure 3E: Alpha hybrid containing 
oligonucleotide Bl mixed in equal volume (7.5 jxl) with Beta hybrid containing an 
antiparallel complementary oligonucleotide (APB2) diluted 1/20. Figure 3E: Alpha 
hybrid containing oligonucleotide Bl mixed in equal volume (7.5 pi) with Beta 
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hybrid containing an antiparallel complementary oligonucleotide (APB2) diluted 
1/10. 

[0035] Figure 4 depicts several schematics of the protein-complementation assay 
supported by nucleic acid interactions. Proteins or protein fragments are labeled with 
short oligonucleotides capable of duplex or triplex complex formation. The design of 
the oligonucleotide probes allows their complementary interactions or triplex 
formation. As shown in Figure 4A, due to complementary interactions, two inactive 
protein fragments come in close contact and undergo conformational changes, 
restoring enzymatic activity , which is detected by a chromogenic or fluorogenic 
reaction. Different types of complementation complexes can be used in this approach. 
Figure 4B shows the use of DNA duplexes. Figure 4C shows duplex and triplex 
formation by nucleic acid mimics. Finally, the inactive protein fragments can be 
fused to DNA- or RNA- recognizing peptides, which interact with target DNA or 
RNA with high affinity. These interactions will also bring two inactive protein 
fragments in close contact, restoring activity and thus signaling the presence of the 
target nucleic acid. 

[0036] Figure 5 depicts the protein-complementation assay supported by triplex 
DNA formation and applied in rolling circle amplification (RCA). Figure 5 A depicts 
synthesis of single-stranded RCA product with triplex-forming regions. Figure 5B 
depicts an in vitro assay to test the restoration of the DHFR activity (or p-lactamase 
activity). 

DETAILED DESCRIPTION OF THE INVENTION 
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[0037] We have now discovered methods to improve the specificity, sensitivity and 
reliability of probe-based assays both in vivo and in vitro. Such improved detection of 
DNA and RNA targets is useful in the detection, analysis and quantitation of nucleic 
acid containing samples. 

[0038] The present invention is based on protein complementation, in which two 
individual polypeptides are inactive. When the two inactive polypeptide fragments 
are brought in close proximity, however, they are able to re-associate into an active 
protein. 

[0039] The present invention is directed to methods for in vitro and in vivo detection 
of target nucleic acid molecules, including DNA and RNA targets, as well as nucleic 
acid analogues. In the method of the present invention, a target nucleic acid is 
detected by its binding to a complementation complex, wherein the complementation 
complex comprises a first complementation molecule and a second complementation 
molecule. The first complementation molecule comprises a first polypeptide portion 
coupled to a first probe portion, which specifically interacts with the target nucleic 
acid; the second complementation molecule comprises a second polypeptide portion 
coupled to a second probe portion, which specifically interacts with the target nucleic 
acid in close proximity to the first probe portion. Simultaneous binding of both probe 
portions to the target nucleic acid brings the first complementation molecule and the 
second complementation molecule into close proximity, such that their polypeptide 
portions are able to interact and form an assembled complementation complex, which 
can be detected by any means which allows recognition of the assembled complex but 
not the individual polypeptides. 
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[0040] In one particularly preferred embodiment, the first and the second probes 
bind to two adjacent sequences on the target nucleic acid, such that the first probe 
binds to a first target sequence and the second probe binds to a second target 
sequence. In this embodiment, the adjacent sequences are close enough to each other 
to allow the first and second polypeptides to interact when the first and second 
molecules are bound to the target, allowing assembly of the complementation 
complex. This embodiment provides for detection of single-stranded and double- 
stranded target nucleic acids. For detection of double stranded targets, the single- 
stranded probes interact, with the double-stranded target to form a triplex. 
[0041 ] In another particularly preferred embodiment, the first and the second probes 
are oligonucleotides and bind to the same sequence on a single-stranded target nucleic 
acid, forming a triplex. In this embodiment, the first and second polypeptides interact 
when the first and second molecules are bound to the target, allowing assembly of the 
complementation complex. 
Complementation complex components 

[0042] The complementation complex comprises a pair of complementation 
molecules, each of which is has a probe portion, for binding to the target nucleic acid, 
coupled to a polypeptide portion, for protein complementation. The individual 
polypeptides are inactive; however, binding of the pair of complementation molecules 
to the target nucleic acid in close proximity brings the two inactive polypeptide 
fragments together, allowing their re-association in to an active protein. 

[0043] The polypeptide portions of each member of the complementation complex 
can be any polypeptides which associate when brought in to close proximity to 
generate a protein, which can be detected by any means which allows recognition of 
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the assembled complex but not the individual polypeptides. For example, the two 
polypeptides may re-associate to generate a protein with enzymatic activity, to 
generate a protein with chromogenic or fluorogenic activity, or which create a protein 
recognized by an antibody. 

[0044] The probe portion of each complementation molecule can be any molecule 
which allows binding to a target nucleic acid at a specific sequence. Preferred probe 
portions include nucleic acids, nucleic acid analogues, and polypeptides. In one 
preferred embodiment, the probe portion is an oligonucleotide. The probe portion of 
the complementation molecule is sometimes referred to simply as the probe. The 
probe portions of a given pair of complementation molecules can be of the same kind 
of molecule, for example oligonucleotides, or they can be different, for example one 
member of the pair has a nucleic acid probe, and the other member of the pair has a 
polypeptide probe. 

[0045] The polypeptide portion and the probe portion of each complementation 
molecule can be coupled by any means which allows the polypeptide portion to 
interact with its corresponding polypeptide partner when the probe portion to bind to a 
target nucleic acid, allowing assembly of the complementation complex. 

[0046] One particularly preferred embodiment of the present invention provides for 
detection of target nucleic acids in vivo. In this embodiment, the complementation 
molecules can be introduced into cells using any method which allows their 
localization at the appropriate site. Delivery methods include microinjection, 
liposome-mediated uptake, and in vivo synthesis of the complementation molecule by 
the cell to be targeted. Specific targeting can be achieved for example using 
subcellular localization markers, which target a protein to a specific compartment 
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such as the nucleus or cytoplasm. Preferred methods include microinjection and in 
vivo synthesis of the complementation molecule by the cell to be targeted. 
[0047] The present invention is directed to using protein complementation to detect 
the presence of a target nucleic acid, such as DNA or RNA. Previous uses of protein 
complementation have been used to look specifically at protein interactions, not 
nucleic acids. Nucleic acids have been used only to bring two polypeptides together 
to detect the presence of a protein. For example, the protein complementation assay 
of Michnick has been used to look directly at protein interactions, such as re-assembly 
of an enzyme, DHFR: Pelletier et al., Proc. Natl Acad. Set USA 95: 12141-12146 
(1998); U.S. Patent Application 20030049688. However, unlike the present 
invention, there is no nucleic acid component in Michinik, and the assay is not 
designed for detection of nucleic acids. Similarly, Landegren teaches a method of 
detecting an analyte using "proximity probes" which contain interacting proteins and 
nucleic acids. U.S. Patent Application No. 20020064779. However, unlike the 
present invention, in the Landegren approach, there is no protein complementation. 
Furthermore, the Landegren approach is essentially the opposite of the present 
invention, because the nucleic acids are brought into close proximity by the protein- 
protein interactions rather than the proteins being brought into close proximity by the 
nucleic acid interactions. 

Polypeptide portions of the complementation molecule 

[0048] The polypeptide portions of each member of the complementation complex 
can be any polypeptides which associate when brought in to close proximity to 
generate a protein, which can be detected by any means, which allows recognition of 
the assembled complex but not the individual polypeptides. 
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[0049] In one preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled, active 
enzyme, which can be detected using an enzyme activity assay. Preferably, .the 
enzyme activity is detected by a chrdmogenic or fluorogenic reaction. In one 
preferred embodiment, the enzyme is dihydrofolate reductase or p-lactamase. 

[0050] In one preferred embodiment, the enzyme is dihydrofolate reductase 
(DHFR). For example, Michnick et al. have developed a "protein complementation 
assay" consisting of N- and C-terminal fragments of DHFR, which lack any 
enzymatic activity alone, but form a functional enzyme when brought into close 
proximity. See e.g. U.S. Patent Nos. 6,428,951, 6,294,330, and 6,270,964, which are 
hereby incorporated by reference. Methods to detect DHFR activity, including 
chromogenic and fluoregenic methods, are well known in the art. 

[0051] In another preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled protein 
with detectable fluorogenic activity. In one preferred embodiment, the assembled 
protein is green fluorescent protein (GFP), a modified GFP such as EGFP, or any 
other natural or genetically engineered fluorescent protein including CFP, YFP, and 
RFP. 

[0052] In another preferred embodiment of the invention, the first and second 
polypeptides interact in the complementation complex to form an assembled protein 
which contains a discontinuous epitope, which may be detected by use of an antibody 
which specifically recognizes the discontinuous epitope on the assembled protein but 
not the partial epitope present on either individual polypeptide. One such example of 
a discontinuous epitope is found in gpl20 of HIV. These and other such derivatives 
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can readily be made by the person of ordinary skill in the art based upon well known 
techniques, and screened for antibodies that recognize the assembled protein by 
neither protein fragment on its own. 

[0053] In another embodiment of the invention, the molecules are proteins which 
interact to form an assembled protein. For example, the molecules may be protein 
fragments, or subunits of a dimer or multimer. 

[0054] The codons used to synthesize the protein of interest may be optimized, 
converting them to codons that are preferentially used in a desired system. For 
example in mammalian cells. Optimal codons for expression of proteins in non- 
mammalian cells are also, known, and can be used when the host cell is a non- 
mammalian cell (for example in insect cells). 

[0055] The molecules bound to the probes of the present invention can contain any 
additional modifications which are desirable. For example, in one preferred 
embodiment, the molecules may also comprise a flexible linker, which is coupled to 
the probe. 

Probe portions of the complementation molecule 

[0056] The first probe portion and the second probe portion may be any molecules 
that can be coupled to another molecule, such as a polypeptide, and are capable of 
binding to a target nucleic acid in close proximity. In one preferred embodiment, the 
probes are nucleic acids or nucleic acid analogues, such as oligonucleotides. In 
another preferred embodiment of the present invention, one or both of the probe 
portions is a nucleic-acid binding polypeptide, which interacts with the target nucleic 
acid with high affinity. Other preferred nucleic acid analogues include peptide 



17 



WO 2004/033485 



PCTAJS2003/032426 



nucleic acids (PNAs), locked nucleic acids, morpholin DNAs, phosphorthioate DNAs, 
and 2 '-O-methoxymethyl-RNAs. 

[0057] One embodiment of the present invention provides for probe portions which 
are nucleic acids, which bind in close proximity to each other on the target nucleic 
acid. The first and second probe portion of the complementation molecules may bind 
to the same hybridization site on a single-stranded target, creating a triplex at the 
hybridization site. Alternatively, the first and second probe portion of the 
complementation molecules may bind to closely adjacent hybridization sites on a 
single-stranded or double-stranded target nucleic acid, creating either a duplex or a 
triplex at each hybridization site, respectively. 

[0058] The length of the probe portion should be long enough to allow 
complementary binding to the nucleic acid target, and should allow the polypeptide 
portion of the first complementation molecule to interact with its corresponding 
polypeptide portion of the second complementation molecule when both probe 
portions are bound to the same target nucleic acid. For example, the probe can be 5 — 
30 bases long. More preferably, 5-15 bases long. 

[0059] In embodiments providing for formation of a triplex, the probe portion can 
be any nucleic acid which allows triplex formation. Preferred triplex-forming 
oligonucleotides are GC-rich. A preferred triplex is a purine triplex, consisting of 
pyrimidine-purine-purine . 

[0060] One preferred triplex-forming oligonucleotide is GC-rich. A preferred 
triplex is a purine triplex, consisting of pyrimidine-purine-purine. 
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[0061] Preferred nucleic acid probe portions are oligonucleotides, single stranded 
RNA molecules, and peptide nucleic acids (PNAs) including pseudocomplementary 
PNAs (pcPNA). 

[0062] In one preferred embodiment, the nucleic acid probe portions are 
oligonucleotides. Methods for designing and synthesizing oligonucleotides are well 
known in the art. Oligonucleotides are sometimes referred to as oligonucleotide 
primers. 

[0063] Oligonucleotides useful in the present invention can be synthesized using 
established oligonucleotide synthesis methods. Methods of synthesizing 
oligonucleotides are well known in the art. Such methods can range from standard 
enzymatic digestion followed by nucleotide fragment isolation (see for example, 
Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 
Spring Harbor, N.Y., (1989), Wu et al, Methods in Gene Biotechnology (CRC Press, 
New York, N.Y., 1997), and Recombinant Gene Expression Protocols, in Methods in 
Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, N.J., 1997), the 
disclosures of which are hereby incorporated by reference), to purely synthetic 
methods, for example, by the cyanoethyl phosphoramidite method using a Milligen or 
Beckman System lPlus DNA synthesizer (for example, Model 8700 automated 
synthesizer of Milligen-Biosearch, Burlington, Mass. or ABI Model 380B). Synthetic 
methods useful for making oligonucleotides are also described by Dcuta et al., Ann. 
Rev. Biochem. 53:323-356 (1984), (phosphotriester and phosphite-triester methods), 
and Narang et al., Methods EnzymoL, 65:610-620 (1980), (phosphotriester method). 
[0064] As used herein, the terms "oligonucleotide" and "primer 11 have the 
conventional meaning associated with it in standard nucleic acid procedures, i.e., an 
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oligonucleotide that can hybridize to a polynucleotide template and act as a point of 
initiation for the synthesis of a primer extension product that is complementary to the 
template strand. 

[0065] Many of the oligonucleotides described herein are designed to be 
complementary to certain portions of other oligonucleotides or nucleic acids such that 
stable hybrids can be formed between them. The stability of these hybrids can be 
calculated using known methods such as those described in Lesnick and Freier, 
Biochemistry 34:10807-10815 (1995), McGraw et al., Biotechniques 8:674-678 
(1990), and Rychlik et al., Nucleic Acids Res. 18:6409-6412 (1990). 

[0066] In one preferred embodiment, the nucleic acid probe portions are single 
stranded RNA molecules. Methods for designing and synthesizing single stranded 
RNA molecules are well known in the art. 

[0067] In another preferred embodiment, the nucleic acid probe portions are peptide 
nucleic acids (PNAs), including pseudocomplementary PNAs (pcPNA). Methods for 
designing and synthesizing PNAs and pcPNAs are well known in the art. Peptide 
nucleic acids (PNAs) are analogs of DNA in which the backbone is a pesudopeptide 
rather than a sugar. Thus, their behavior mimics that of DNA and binds 
complementary nucleic acid strands. In peptide nucleic acids, the deoxyribose 
phosphate backbone of oligonucleotides has been replaced with a backbone more akin 
to a peptide than a sugar phosphodiester. Each subunit has a naturally occurring or 
non naturally occurring base attached to this backbone. One such backbone is 
constructed of repeating units of N-(2-aminoethyl)glycine linked through amide 
bonds. 



20 



WO 2004/033485 



PCT7US2003/032426 



[0068] PNA binds both DNA and RNA. The resulting PNA/DNA or PNA/RNA 
duplexes are bound with greater affinity and increased specificity than corresponding 
DNA/DNA or DNA/RNA duplexes. In addition, their polyamide backbone (having 
appropriate nucleobases or other side chain groups attached thereto) is not recognized 
by either nucleases or proteases, and thus PNAs are resistant to degradation by 
enzymes, unlike DNA and peptides. The binding of a PNA strand to a DNA or RNA 
strand can occur in either a parallel of anti-parallel orientation. PNAs bind to both 
single stranded DNA and double stranded DNA. 
[0069] To address the sequence limitations of traditional PNAs, 
pseudocomplementary PNAs (pcPNAs) have been developed. In addition to guanine 
and cytosine, pcPNA's carry 2,6-diaminopurine (D) and 2-thiouracil instead of 
adenine and thymine, respectively. pcPNAs exhibit a distinct binding mode, double- 
duplex invasion, which is based on the Watson-Crick recognition principle 
supplemented by the notion of pseudocomplentarity. pcPNAs recognize and bind 
with their natural A, T, (U), or G, C counterparts. pcPNAs can be made according to 
any method known in the art. For example, methods for the chemical assembly of 
PNAs are well known (See: U.S. Pat Nos. 5,539,082, 5,527,675, 5,623,049, 
5,714,331, 5,736,336, 5,773,571 or 5,786,571, herein incorporated by reference). 
[0070] Other embodiments of the invention.provide a probe portion which is a 
polypeptide. The polypeptide can be any polypeptide with a high affinity for the 
target nucleic acid. In this embodiment, the target nucleic acid can be a double- 
stranded, triple-stranded, or single-stranded DNA or RNA. Preferably, the 
polypeptides is a peptide, less than 100 amino acids, or a full length protein. The 
polypeptide's affinity for the target nucleic acid can in the low nanomolar to high 

21 



WO 2004/033485 



PCT7US2003/032426 



picomolar range. Preferred polypeptides include polypeptides which contain zinc 
fingers, either natural or designed by rational or screening approaches. Examples of 
zinc fingers include Zif 2g8, Spl, finger 5 of Gfi-1, finger 3 of YY1, finger 4 and 6 of 
CF2II, and finger 2 of TTK (PNAS (2000) 97: 1495-1500; J Biol Chem (20010 276 
(21): 29466-78; Nucl Acids Res (2001) 29 (24) :4920-9; Nucl Acid Res (2001) 
29(1 1): 2427-36). Other preferred polypeptides include polypeptides, obtained by in 
vitro selection, that bind to specific nucleic acids, sequences. Examples of such 
aptamers include platelet-derived growth factor (PDGF) (Nat Biotech (2002) 20:473- 
77) and thrombin (Nature(1992) 355: 564-6. Yet other preferred polypeptides are 
polypeptides which bind to DNA triplexes in vitro; examples include members of the 
heteronuclear ribonucleic particles (hnRNP) proteins such as hnRNP K, L, El , A2/B1 
and I (Nucl Acids Res (2001)29(1 1): 2427-36). 

[007 1 ] For complementation molecules which have a polypeptide for the probe 
portion, the entire complementation molecule can be encoded by a single construct, 
including the polypeptide portion, a linker and the probe portion which is also a 
polypeptide. This construct can either be expressed in the cell or microinjected into 
the cell. These constructs can also be used for in vitro detection of a nucleic acid of 
interest. 

Coupling mechanisms 

[0072] The complementation molecules of the present invention have the 
polypeptide portion of the molecule coupled to the probe portion of the 
complementation molecule, such that when the probe portions of the first molecule 
and the second complementation molecules are bound to the target nucleic acid, the 
two polypeptides of the first and second complementation molecules can interact. 
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The coupling mechanism for joining the probe portions to the polypeptides portions 
can be any mechanism which allows these interactions. 

[0073] In one preferred embodiment, a flexible linker can be used to couple the 
probe portion and the polypeptide portion of the complementation molecule. In one 
preferred embodiment, the N-terminal amino acid and/or the C-terminal amino acid of 
each polypeptide is cysteine, and the probe has a thiol group at the 3' or 5 ' end, to 
allow coupling to the N-terminal and/or C-terminal cysteine. In another preferred 
embodiment, both the probe portion and the polypeptide portion are polypeptides, and 
they are coupled by N-terminal and/or C-terminal cysteines. In yet another preferred 
embodiment, a streptavidin-biotin interaction is used to couple the probe, portion and 
the polypeptide portion of the complementation molecule. 

[0074] Coupling may be accomplished by any chemical reaction that will bind the 
probe portion to the polypeptide portion, as long as the probe portion remains able to 
bind the hybridization site, and the polypeptide portion is able to interact with its 
partner to form an assembled molecule. This linkage can include for instance 
covalent binding, affinity binding, intercalation, coordinate binding and complexation. 
In one preferred embodiment, the coupling is performed in vitro and the binding is 
covalent. For example, for a protein, covalent binding can be achieved either by 
direct condensation of existing side chains or by the incorporation of external bridging 
molecules. Many bivalent or polyvalent linking agents are useful in coupling protein 
molecules to other molecules such as the probes of the present invention. For 
example, representative coupling agents can include organic compounds such as 
thioesters, carbodiimides, succinimide esters, disocyanates, glutaraldehydes, 
diazobenzenes and hexamethylene diamines. This listing is not intended to be 
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exhaustive of the various classes of coupling agents known in the art but, rather, is 
exemplary of the more common coupling agents (see Killen and Lindstrom, J. 
Immunol 133:1335-2549, 1984; Jansen, F. K., etal, Imm. Rev. 62:185-216, 1982; 
and Vitetta et al, supra). 

[0075] Preferred linkers are described in the literature. See, for example, 
Ramakrishnan, S., et al, Cancer Res. 44: 201-208 (1984), describing the use of MBS 
(M-maleimidobenzoylrN-hydroxysuccinimide ester). See also Umemoto et al, U.S. 
Patent 5,030,719, describing the use of a halogenated acetyl hydrazide derivative 
coupled to an antibody by way of an oligopeptide linker. Particularly preferred 
linkers include: (i) EDC (l-ethyl-3-(3-dimethylamino-propyl) carbodiimide 
hydrochloride; (ii) SMPT (4-succinimidyloxycarbonyl-alpha-methyl-alpha-(2- 
pyridyl-dithio)-toluene (Pierce Chem. Co., Cat. (21558G); (iii) SPDP (succinimidyl-6 
[3-(2-pyridyldithio) propionamido] hexanoate (Pierce Chem. Co., Cat #21651G); 
(iv) Sulfo-LC-SPDP (sulfosuccinimidyl 6 [3-(2-pyridyldithio)-propianamide] 
hexanoate (Pierce Chem. Co. Cat. #2165-G); and (v) sulfo-NHS (N-hydroxysulfo- 
succinimide: Pierce Chem. Co., Cat. #24510) conjugated to EDC. 
[0076] The linkers described above contain components that have different 
attributes, thus leading to conjugates with differing physio-chemical properties. For 
example, sulfo-NHS esters of alkyl carboxylates are more stable than sulfo-NHS 
esters of aromatic carboxylates. NHS-ester containing linkers are less soluble than 
sulfo-NHS esters. Further, the linker SMPT contains a sterically hindered disulfide 
bond, and can form conjugates with increased stability. Disulfide linkages, are in 
general, less stable than other linkages because the disulfide linkage is cleaved m 
vitro, resulting in less conjugate available. Sulfo-NHS,. in particular, can enhance the 
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stability of carbodimide couplings. Carbodimide couplings (such as EDC) when used 
in conjunction with sulfo-NHS, forms esters that are more resistant to hydrolysis than 
the carbodimide coupling reaction alone. 

[0077] In one preferred embodiment, the molecules are proteins which contain an 
N-terminal and/or C-terminal cysteine, which can be coupled to an oligonucleotide 
carrying a thiol group at either the 3* or 5 'end and a donor or acceptor at the 5* or 3' 
end, respectively. In this embodiment it is desirable to substitute any other cysteines 
in the protein to other amino acids. 
Nucleic acid target 

[0078] The method of the present invention can be used to detect the presence of a 
single-stranded nucleic acid target or a double-stranded nucleic acid, by generating a 
detectable signal associated with formation of the complementation complex. 

[0079] The nucleic acid target can- be any nucleic acid which contains hybridization 
sites for binding to the first and second probes. For example, the target nucleic acid 
can be a DNA, an RNA, or a nucleic acid analogue. 

[0080] The target nucleic acid can be single-stranded or double-stranded. 

[0081] The target nucleic acid can be detected in vivo ox in vitro. In one preferred 
embodiment, the method of the present invention is used to detect a target nucleic 
acid in vzvo, and the polypeptides interact to generate a protein with fluorogenic 
activity. Preferably, the polypeptides encode GFP, a modified GFP such as EGFP, or 
any other natural or genetically engineered fluorescent proteins including CFP, YFP, 
andRFP. 

[0082] In one particularly preferred embodiment, the first and the second probes 
bind to two adjacent sequences on the target nucleic acid, such that the first probe 
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binds to a first target sequence and the second probe binds to a second target 
sequence. In this embodiment, the adjacent sequences are close enough to each other 
to allow the first and second polypeptides to interact when the first and second 
complementation molecules are bound to the target, allowing assembly of the 
complementation complex. This embodiment provides for detection of single- 
stranded and double-stranded target nucleic acids. For detection of double stranded 
targets, the single-stranded probes interact with the double-stranded target to form a 
triplex. 

[0083] In another particularly preferred embodiment, the first and the second probes 
are oligonucleotides and bind to the same sequence on a single-stranded target nucleic 
acid, forming a triplex. In this embodiment, the first and second polypeptides interact 
when the first and second complementation molecules are bound to the target, 
allowing assembly of the complementation complex. 

[0084] Any nucleic acid sample may be used in practicing the present invention, 
including without limitation eukaryotic, prokaryotic and viral DNA or RNA. In a 
preferred embodiment, the target nucleic acid represents a sample of genomic DNA 
isolated from a patient. This DNA may be obtained from any cell source or body 
fluid. Non-limiting examples of cell sources available in clinical practice include 
blood cells, buccal cells, cervicovaginal cells, epithelial cells from urine, fetal cells, or 
any cells present in tissue obtained by biopsy. Body fluids include blood, urine, 
cerebrospinal fluid, semen and tissue exudates at the site of infection or inflammation. 

■i 

In one preferred embodiment, the DNA is detected directly in the sample, without any 
additional purification. In another embodiment, the DNA is extracted from the cell 
source or body fluid using any of the numerous methods that are standard in the art. It 
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will be understood that the particular method used to extract DNA will depend on the 
nature of the source. The preferred amount of DNA to be extracted for use in the 
present invention is at least 5 pg (corresponding to about 1 cell equivalent of a 
genome size of 4x109 base pairs). 

[0085] In one preferred embodiment, the target nucleic acid can be amplified prior 
to exposure to the components of the complementation complex. Any method of 
amplifying a nucleic acid target can be used, including methods which generate a 
single stranded nucleic acid with a multiplicity of the same hybridization sites. The 
amplification reaction can be polymerase chain reaction (PCR), ligase chain reaction 
(LCR), strand displacement amplification (SDA), transcription mediated 
amplification (TMA), Q(3-replicase amplification (Q-beta), or rolling circle 
amplification (RCA). 

[0086] In one preferred embodiment, PCR is used to amplify the nucleic acid target. 

[0087] Any polymerase which can synthesize the desired nucleic acid may be used. 
Preferred polymerases include but are not limited to Sequenase, Vent, and Taq 
polymerase. Preferably, one uses a high fidelity polymerase (such as Clontech HF-2) 
to minimize polymerase-introduced mutations. 

[0088] In another preferred embodiment, rolling circle amplification (RCA) is used 
to generate a single-stranded DNA target with a multiplicity of the same hybridization, 
sites. Rolling circle amplification (RCA) is an isothermal process for generating 
multiple copies of a sequence. In rolling circle DNA replication in vivo, a DNA 
polymerase extends a primer on a circular template (Romberg, A. and Baker, T. A. 
DNA Replication, W. H. Freeman, New York, 1991). The product consists of 
tandemly linked copies of the complementary sequence of the template. RCA is a 
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method that has been adapted for use in vitro for DNA amplification (Fire, A. and Si- 
Qun Xu, Proc. Natl. Acad Sci. USA, 1995, 92:4641-4645; Lui, D., et al., J. Am. 
Chem. Soc, 1996, 118:1587-1594; Lizardi, P. M., et al., Nature Genetics, 1998, 
19:225-232; U.S. Pat. No. 5,714,320 to Kool). 

[0089] In RCA techniques a primer sequence having a region complementary to an 
amplification target circle (ATC) is combined with an ATC. Following hybridization, 
enzyme, dNTPs, etc. allow extension of the primer along the ATC template, with 
DNA polymerase displacing the earlier segment, generating a single stranded DNA 
product which consists of repeated tandem units of the original ATC sequence. 
[0090] RCA techniques are well known in the art, including linear RCA (LRCA). 
Any such RCA technique can be used in the present invention. Strand displacement 
during RCA can be facilitated through the use of a strand displacement factor, such as 
helicase. In general, any DNA polymerase that can perform rolling circle replication 
in the presence of a strand displacement factor is suitable for use in the processes of 
the present invention, even if the DNA polymerase does not perform rolling circle 
replication in the absence of such a factor. Strand displacement factors useful in RCA 
include BMRF1 polymerase accessory subunit (Tsurumi et al., J. Virology 
67(12):7648-7653 (1993)), adenovirus DNA-binding protein (Zijderveld and van der 
Vliet, J. Virology 68(2): 1 158-1 164 (1994)), herpes simplex viral protein ICP8 
(Boehmer and Lehman, J. Virology 67(2):71 1-715 (1993); Skaliter and Lehman, Proc. 
Natl. Acad. Sci. USA 91(22):10665-10669 (1994)), single-stranded DNA binding 
proteins (SSB; Rigler and Romano, J. Biol. Chem. 270:8910-8919 (1995)), and calf 
thymus helicase (Siegel et al., J Biol. Chem. 267:13629-13635 (1992)). The ability of 
a polymerase to carry out rolling circle replication can be determined by using the 
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polymerase in a rolling circle replication assay such as those described in Fire and Xu, 
Proc. Natl. Acad. Sci. USA 92:4641-4645 (1995) and in Lizardi (U.S. Pat No. 
5,854,033, e.g., Example 1 therein). 
Applications 

[0091] The method of the present invention can be used to detect the presence of a 
target nucleic acid of interest either in vivo or in vitro. Because the methods, kits and 
compositions of this invention are directed to the specific detection of target nucleic 
acid, even in the presence of non-target sequences, they are particularly well suited 
for the development of sensitive and reliable probe-based hybridization assays 
designed to analyze for point mutations. The methods, kits and compositions of this 
invention are also useful for the detection, quantitation or analysis of organisms 
(micro-organisms), viruses, fungi and genetically based clinical conditions of interest. 
[0092] In one embodiment, the present invention provides methods for isolating a 
target nucleic acid in . a sample, even in the presence of non-target sequences. 
[0093] One embodiment of the invention provides for the use of the 
complementation molecules to screen for a target nucleic acid. For example, to 
identify a target nucleic acid of interest in a population of other non-target nucleic 
acids. In this embodiment, the target nucleic acids or the complementation molecules 
of the present invention can be used in a form in which they are attached, by whatever 
means is convenient, to some type of solid support. Attachment to such supports can 
be by means of some molecular species, such as some type of polymer, biological or 
otherwise, that serves to attach said primer or ATC to a solid support so as to facilitate 
detection of tandem sequence DNA produced by rolling circle amplification using the 
methods of the invention. 
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[0094] Such solid-state substrates useful in the methods of the invention can include 
any solid material to which oligonucleotides can be coupled. This includes materials 
such as acrylamide, cellulose, nitrocellulose, glass, polystyrene, polyethylene vinyl 
acetate, polypropylene, polymethacrylate, polyethylene, polyethylene oxide, glass, 
polysilicates, polycarbonates, teflon, fluorocarbons, nylon, silicon rubber, 
polyanhydrides, polyglycolic acid, polylactic acid, polyorthoesters, 
polypropylfumerate, collagen, glycosaminoglycans, and polyamino acids. Solid-state 
substrates can have any useful form including thin films or membranes, beads, bottles, 
dishes, fibers, woven fibers, shaped polymers, particles and microparticles. A 
preferred form for a solid-state substrate is a glass slide or a microtiter dish (for 
example, the standard 96-well dish). For additional arrangements, see those described 
in U.S. Pat. No. 5,854,033. 

[0095] Methods for immobilization of oligonucleotides to solid-state substrates are 
well established. Oligonucleotides, including address probes and detection probes, 
can be coupled to substrates using established coupling methods. For example, 
suitable attachment methods are described by Pease et al., Proc. Natl. Acad. Sci. USA 
91(1 1):5022-5026 (1994). A preferred method of attaching oligonucleotides to solid- 
state substrates is described by Guo et al., Nucleic Acids Res. 22:5456-5465 (1994). 
[0096] In another preferred embodiment, the present invention provides kits suitable 
for detecting the presence and/or amount of a target nucleic acid in a sample. Said 
kits comprise at least a first probe coupled to a first molecule and a second probe 
coupled to a second molecule, wherein the probes can bind to a hybridization 
sequence in a target nucleic acid. Preferably, the probes are in vials. Said kits also 
comprise reagents suitable for capturing and/or detecting the present or amount of 
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target nucleic acid in a sample. The reagents for detecting the present and/or amount 
of target nucleic acid can include enzymatic activity reagents or an antibody specific 
for the assembled protein. The antibody can be labeled. Such kits may optionally 
include the reagents required for performing RCA reactions, such as DNA 
polymerase, DNA polymerase cofactors, and deoxyribonucleotide-5 -triphosphates. 
Optionally, the kit may also include various polynucleotide molecules, DNA or RNA 
ligases, restriction endonucleases, reverse transcriptases, terminal transferases, 
various buffers and reagents, and antibodies that inhibit DNA polymerase activity. 
These components are in containers, such as vials. The kits may also include reagents 
necessary for performing positive and negative control reactions, as well as 
instructions. Optimal amounts of reagents to be used in a given reaction can be 
readily determined by the skilled artisan having the benefit of the current disclosure. 
• [0097] The target nucleic acid can be of human origin. The target nucleic acid can 

be DNA or RNA. The target nucleic acid can be free in solution, or immobilized to a 
solid support. 

[0098] In one embodiment, the target nucleic acid is specific for a genetically based 
disease or is specific for a predisposition to a genetically based disease. Said diseases 
can be, for example, .beta.-Thalassemia, Sickle cell anemia or Factor- V Leiden, 
genetically-based diseases like cystic fibrosis (CF), cancer related targets like p53 and 
plO, or BRC-1 and BRC-2 for breast cancer susceptibility. In yet another 
embodiment, isolated chromosomal DNA may be investigated in relation to paternity 
testing, identity confirmation or crime investigation. 

[0099] The target nucleic acid can be specific for a pathogen or a microorganism. 
Alternatively, the target nucleic acid can be from a virus, bacterium, fungus, parasite 
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or a yeast; wherein hybridization of the complementation molecules to the target 
nucleic acid is indicative of the presence of said pathogen or microorganism in the 
sample. 

[00 1 00] The target nucleic acid may also be useful for the detection of bacteria and 
eucarya in food, beverages, water, pharmaceutical products, personal care products, 
dairy products or environmental samples. Preferred beverages include soda,, bottled 
water, fruit juice, beer, wine or liquor products. Assays developed will be particularly 
usefiil for the analysis of raw materials, equipment, products or processes used to 
manufacture or store food, beverages, water, pharmaceutical products, personal care 
products, dairy products or environmental samples. 

EXAMPLES 

EXAMPLE 1: DNA-assisted protein complementation using 21-mer complementary 
oligonucleotides as interacting partners and Enhanced Green Fluorescent Protein 
(EGFP^ 

[00101] DNA assisted protein complementation was assayed by mixing two protein- 
DNA hybrids based on the Enhanced Green Fluorescent Protein (EGFP) molecule. 
The following steps were done to perform the experiments. 

Cloning, expression and purification of polypeptides 

[001 02] The following genes were obtained by PCR and cloned in the TWIN-1 
vector (New England Biolabs, MA), as C-terminal fusion of Ssp DNAB intein. 
Construct 1 : Full EGFP-1 (Clontech, CA). Construct 2: a-cys polypeptide containing 
the 157 amino acid N-terminal portion of EGFP, plus a C-terminal cysteine. 
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Construct 3: p-cys fragment polypeptide containing the remaining 82 amino acid C- 
terminal portion of EGFP, plus an N-terminal cysteine. These constructs were 
verified by sequencing. 

[00103] The proteins were expressed in BL21(DE3)pLys competent cells (Stratagene, 
CA). Induction was done with IPTG and cells were allowed to grow for 4 hrs at 

30°C. Cells were lysed in Buffer S (50 mM HEPES pH 8.5, 0.5 M NaCl, 1 mM 
EDTA ,10 mM DTT, Triton X-100 0.1 %. 1 mM PMSF) by freeze and thaw method. 
After DNAse I treatment, samples were centrifuged at 13000 g for 30 minutes at 4°C. 
The supematants were applied to a chitin affinity chromatography resin (New 
England Biolabs, MA) equilibrated with Buffer B (50 mM HEPES pH 8,5, 0.5 M 
NaCl, 1 mM EDTA ,10 mM DTT, Triton X-100 0.1 %. 0.5 mM PMSF). After that, 
the column was washed extensively with the same buffer. Then the column was 
equilibrated in Buffer E (50 mM HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA ,10 mM 
DTT, Triton X-100 0.1 %. 0.5 mM PMSF) and let at 4°C for 24-48 hrs for the intein 
cleavage to take place. In the next step, eluates were collected and protein 
concentration and purity were analyzed by SDS-PAGE and Coomasie Plus Protein 
Assay Reagent (Pierce, IL). 

Conjugation of the EGFP fragments with the oligonucleotides 

[00104] DNA oligonucleotides with an SH 2 group at either the 5 or 3' end were 
purchased from Integrated DNA Technologies, IO. The oligonucleotides have the 
following sequences: 

Oligonucleotide Bl: 5 '-SH2-C6 — CGACTGCGTTAGC ATGTGTTG-3 " (SEQ ID 
NO:l). "C6" is a 6 carbon spacer. 
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Oligonucleotide B2: 5'-GCTGACGCAATCGTACACAAC-C3-C3-SH2-3' (SEQ ID . 
NO:2). B2 is complementary and parallel to Bl . "C3" is a 3 carbon spacer. 

Oligonucleotide AP-B2: 5 '-SH2-C6 — CAACACATGCTAACGCAGTCG-3 • (SEQ 
ID NO:3). AP-B2 is complementary and antiparallel to Bl . 

[001 05] The protein fragments (1 .6 uM) were mixed with an excess of 
oligonucleotides (50 uM) in 500 ul of Buffer M(50 mM HEPES pH 7.0, 0.5 M NaCl, 
1 mM EDTA ,DTT 10 mM, PMSF 0.5 mM, 10 mM Tris-Cl). They were paired as 
follows: a-cys fragment plus oligonucleotide Bl . (a -Bl , also referred to as AC-B 1). 
P-cys fragment plus oligonucleotide B2. (P~B2, also referred to as BC-B2). p-cys 
fragment plus oligonucleotide APB2. (P-APB2, also referred to as BC-APB2). P-cys 
fragment plus oligonucleotide Bl. (p-Bl, also referred to as BC-B1). 

[00106] After mixing the samples, they were kept on ice for 3-4 hours. Afterwards, 
the samples were applied to a NAP-5 column (Sephadex G-25 Medium DNA-grade) 
(Pharmacia) at room temperature by gravity flow, to remove the DTT and thus 
activate the SH2 groups in both molecules. The column was equilibrated with Buffer 
C (50 mM HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA, PMSF 0.5 mM ). Fractions 
were analyzed by UV spectroscopy at 260 and 280 nm. 

Results 

[00107] Figure 1 shows a typical chromatographic profile of 1/10 diluted samples 

from a conjugation experiment. In Figure 1, the chromatographic profile of a 

conjugation mixture (Alpha-cys plus oligonucleotide Bl) on a Sephadex G-25 

column. Alpha-cys fragment (1 .6 uM) was mixed with oligonucleotide Bl (50 uM) in 

500 ul of buffer M (50 mM HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA, 10 mM DTT, 
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0.5 mM PMSF, 10 mM Tris-Cl) and left on ice for 3-4 hrs is shown. Afterwards, the 
sample was applied by gravity flow to a G-25 Sephadex column (NAP-5, Pharmacia) 
equilibrated with buffer C (50 mM HEPES pH 7.0, 0.5 M NaCl, 1 mM EDTA, 0.5 
mM PMSF). Fractions (-0.17 ml)) were collected at room temperature. 

[00108] The chromatographic profiles were similar in all the conjugation 
experiments. Samples from the beginning of the first peak (usually fraction #4) were 
concentrated about 3 times in a centricon (Millipore, MA) at 4°C. Filters with a 10 
KD cut-off were used for the Alpha-DNA hybrids and filters with a 3 KD cut-off 
were used for the Beta-DNA hybrids. 

[00109] Reconstitution of functional EGFP via a DNA-DNA interaction 

[001 10] In a typical experiment, one protein-DNA hybrid (Alpha-Bl for instance) is 
mixed in equal volume (7.5 ul each hybrid) with its counterpart (Beta-B2) in a 1.5 ml 
microfuge tube. Immediately afterwards, this mix is added to 300 ul of buffer F (20 
mM Tris-Cl ph 8.0, 50 mM NaCl, 0.1 % Triton X-100) already in the cuvette. The 
solution is mixed by inversion twice and placed in the fluorimeter. 

[001 1 1] Excitation was made at 488 nm and the emission spectra recorded between 
490 and 550 nm. Scan speed was 60 nm/min, the excitation and emission slits were 
set at: 10 nm and the PMT voltage was 700 volts. Fluorescence measurements were 
done in a Spectrophotometer FL F-2500 fluorimeter (Hitachi) at room temperature. 
Samples were kept on ice before and after experiments. 

[001 12] Several controls were included in the experiments: a) Full EGFP; b) Alpha- 
cys and Beta-cys fragments without DNA, separately and combined; and c) Paired 
DNA oligonucleotides (B1-B2, B1-APB2) without proteins. 
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[001 13] Figure 2 shows the fluorescence spectra of full EGFP, and different mixtures 
of Alpha-cys and Beta-cys fragments and the conjugated protein-DNA fragments with 
the Bl, B2 and APB2 oligonucleotides. These results show that the combination of 
the Alpha fragment of EGFP subjected to conjugation with the Bl oligonucleotide 
(Alpha-Bl hybrid) and the Beta fragment of EGFP subjected to conjugation with the 
complementary antiparallel oligonucleotide APB2 (Beta-APB2 hybrid) generated a 
fluorescence significantly above background. Unexpectedly, we also observed a 
strong fluorescence signal when the Alpha-Bl hybrid was mixed with the Beta 
fragment of EGFP subjected to conjugation with the B2 oligonucleotide (Beta-B2 
hybrid), which is parallel to the Bl oligonucleotide. 

[001 14] In a separate series of experiments, it was shown that using the same 
oligonucleotide (Bl) conjugated with both the Alpha-cys and Beta-cys fragments of 
EGFP did not produce a fluorescent signal above the background given by the protein 
fragments alone. 

[001 1 5] Figure 3 shows the difference in maximum between parallel and antiparallel 
orientations of the DNA oligonucleotides. It should be emphasized that the complexes 
are stable at 4oC since the spectra shown in Figure 3 were recorded 24 hours after the 
complementation experiments. 

[001 1 6] Figure 3 shows the results of parallel versus antiparallel orientation and 
stability of a DNA assisted protein complementation assay. 24 hours after the initial 
measurements described in Figure 2, the mixtures that produced fluorescence (Alpha- 
DNA plus Beta-DNA hybrids) were serially diluted in buffer F (20 mM Tris-Cl, pH 
8.0, 50 mM NaCl, 0.1% Triton X-100) and assayed for fluorescence. Figure 3A: 
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Buffer. Figure 3B: Alpha hybrid containing oligonucleotide Bl mixed in equal 
volume (7.5 jil) with Beta hybrid containing a parallel complementary oligonucleotide 
(B2) diluted 1/20. Figure 3C: Alpha hybrid containing oligonucleotide Bl mixed in 
equal volume (7.5 pi) with Beta hybrid containing a parallel complementary 
oligonucleotide (B2) diluted 1/1 0. Figure 3E: Alpha hybrid containing 
oligonucleotide Bl mixed in equal volume (7.5 jal) with Beta hybrid containing an 
antiparallel complementary oligonucleotide (APB2) diluted 1/20. Figure 3E: Alpha 
hybrid containing oligonucleotide Bl mixed in equal volume (7.5 jul) with Beta 
hybrid containing an antiparallel complementary oligonucleotide (APB2) diluted 
1/10. 

[001 17] These experiments show that the presence in the mixture of protein-DNA 
hybrids with complementary oligonucleotides is necessary to produce a fluorescence 
signal above the background. This shows the feasibility of a protein complementation 
driven by DNA-DNA interactions provided by complementarity of 21 mer DNA 
duplexes. A more detailed biochemical characterization of the DNA-assisted protein 
complementation reaction is being done at our lab at the moment. 

EXAMPLE 2 Protein-complementation assay supported bv triplex DNA formation 
and applied in rolling circle amplification . 

[001 1 8] The general purpose of the present invention consists of developing a new 
universal and robust method for detection of DNA and RNA targets. The method can 
be also used in protein-protein interaction studies. One embodiment of the invention 
utilizes the protein fragment complementation assay (PCA). PCA relies on re- 
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association of two inactive protein fragments into an active enzyme if and only if 
there are additional interactions assisting protein re-association. In the original PCA 
method developed by Michnick these are protein-protein interactions, which support 
re-association of the marker enzyme (1). In our method, the enzyme activity relies on 
DNA-DNA or DNA-RNA or RNA-RNA interactions (Figures 4 and 5). 
[001 19] In one embodiment, a single-stranded DNA target is detected in the presence 
of two triplex forming oligonucleotides coupled with the inactive enzyme fragments 
such as dihydrofolate reductase, DHFR, or P-lactamase (Figure 5), or two inactive 
protein fragments such as EGFP (Figures 1 - 3). The triplex-forming oligonucleotide 
is GC-rich to make relatively stable complex. Purine triplexes (PyPuPu) fit most our 
purposes, because they are stable enough at neutral pH (2). 

[00120] In this embodiment, linear rolling circle amplification (RCA) is performed to 
generate single-stranded DNA product (3,4). The active enzyme is assembled from 
the fragments in the presence of the increasing amount of triplex-forming sequence 
produced by RCA, and its activity is detected by chromogenic or fluorogenic reaction. 
This assay differs substantially from the classic RCA because in this case, signal 
amplification is mostly gained by the enzymatic activity of the re-assembled protein. 
This format will allow for faster analysis then in conventional RCA and sensitivity 
comparable with hyper-branched RCA. 

[00121] To reduce this approach to practice, we will use N- and C-terminal fragments 

of dihydrofolate reductase (DHFR), the system developed by S. Michnick (Pelletier et 

al., 1998). We will synthesize DHFR peptides in vivo by expressing the corresponding 

plasmids in E.colL Each plasmid will code for the DHFR fragment, flexible peptide 

linker attached to the N-terminus, and the His-tag at the C-terminus. The N-terminal 
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amino acid will be designed to be cystein (the only cystein in DHFR, C7, will be 
substituted for Thr). The plasmids coding for the fragments of DHFR are available 
from the Michnick's lab. They will be modified if necessary by using PCR with 
corresponding primers. The peptides will be purified from E. coli cell lysates and 
coupled with oligonucleotides carrying thiol-group at the 3'- or 5 '-end and donor or 
acceptor at the 5' or 3' end, respectively. The by-products oligonucleotide- and 
peptide-dimers will be removed. 
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